智能论文笔记

HEROHE Challenge: assessing HER2 status in breast cancer without immunohistochemistry or in situ hybridization

Eduardo Conde-Sousa , João Vale , Ming Feng , Kele Xu , Yin Wang , Vincenzo Della Mea , David La Barbera , Ehsan Montahaei , Mahdieh Soleymani Baghshah , Andreas Turzynski

分类：计算机视觉

2021-11-08

乳腺癌是女性最常见的恶性肿瘤，每年负责超过50万人死亡。因此，早期和准确的诊断至关重要。人类专业知识是诊断和正确分类乳腺癌并定义适当的治疗，这取决于评价不同生物标志物如跨膜蛋白受体HER2的表达。该评估需要几个步骤，包括免疫组织化学或原位杂交等特殊技术，以评估HER2状态。通过降低诊断中的步骤和人类偏差的次数的目标，赫洛挑战是组织的，作为第16届欧洲数字病理大会的并行事件，旨在自动化仅基于苏木精和曙红染色的HER2地位的评估侵袭性乳腺癌的组织样本。评估HER2状态的方法是在全球21个团队中提出的，并通过一些提议的方法实现了潜在的观点，以推进最先进的。

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Cin{é}matique d'une Proth{è}se de Main Myo{é}lectrique Accessible avec Actionneur Unique et R{é}tropulsion Passive du Pouce

Côme Butin , Damien Chablat , Yannick Aoustin , David Gouaillier

分类：机器人

2022-08-23

这项工作提出了一种新的动力学的新运动学，该动力学与单个执行器有关，可以实现三方握力，也可以实现侧向握力。受三位生假体的启发，比多物质假体更简单，更健壮和便宜，这种新的运动学旨在提出可访问的假体（负担得起的，易于使用，易于使用，健壮，易于修复）。使用电缆代替刚性杆来传递上指和拇指的动作。本文详细介绍了方法和设计选择。总而言之，通过实验用户对原型的评估导致对结果的首次讨论。

translated by 谷歌翻译

ScoreCAM GNN: une explication optimale des réseaux profonds sur graphes

Adrien Raison , Pascal Bourdon , David Helbert

分类：机器学习

2022-07-26

深层网络的解释性正在成为深度学习社区中的一个核心问题。在图形上学习是相同的，这是许多现实世界中存在的数据结构。在本文中，我们提出了一种比最新方法更优化，更轻，一致和更好利用评估图的拓扑的方法。

translated by 谷歌翻译

ISLES 2022: A multi-center magnetic resonance imaging stroke lesion segmentation dataset

Moritz Roman Hernandez Petzsche , Ezequiel de la Rosa , Uta Hanning , Roland Wiest , Waldo Enrique Valenzuela Pinilla , Mauricio Reyes , Maria Ines Meyer , Sook-Lei Liew , Florian Kofler , Ivan Ezhov

分类：计算机视觉

2022-06-14

磁共振成像（MRI）是中风成像的中心方式。它被用来接受患者的治疗决定，例如选择患者进行静脉溶栓或血管内治疗。随后在住院期间使用MRI来通过可视化梗塞核心大小和位置来预测结果。此外，它可以用来表征中风病因，例如（心脏） - 栓塞和非胚胎中风之间的区分。基于计算机的自动医疗图像处理越来越多地进入临床常规。缺血性中风病变分割（ISLE）挑战的先前迭代有助于生成鉴定急性和急性缺血性中风病变分割的基准方法。在这里，我们介绍了一个专家注册的多中心MRI数据集，以分割急性到亚急性中风病变。该数据集包括400个多供应商MRI案例，中风病变大小，数量和位置的可变性很高。它分为n = 250的训练数据集和n = 150的测试数据集。所有培训数据将公开可用。测试数据集将仅用于模型验证，并且不会向公众发布。该数据集是Isles 2022挑战的基础，目的是找到算法方法，以实现缺血性中风的稳健和准确分割算法的开发和基准测试。

translated by 谷歌翻译

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Aarohi Srivastava , Abhinav Rastogi , Abhishek Rao , Abu Awal Md Shoeb , Abubakar Abid , Adam Fisch , Adam R. Brown , Adam Santoro , Aditya Gupta , Adrià Garriga-Alonso

分类：自然语言处理 | 人工智能 | 机器学习 | (统计)机器学习

2022-06-09

语言模型既展示了定量的改进，又展示了新的定性功能，随着规模的增加。尽管它们具有潜在的变革性影响，但这些新能力的特征却很差。为了为未来的研究提供信息，为破坏性的新模型能力做准备，并改善社会有害的效果，至关重要的是，我们必须了解目前和近乎未来的能力和语言模型的局限性。为了应对这一挑战，我们介绍了超越模仿游戏基准（Big Bench）。 Big Bench目前由204个任务组成，由132家机构的442位作者贡献。任务主题是多样的，从语言学，儿童发展，数学，常识性推理，生物学，物理学，社会偏见，软件开发等等。 Big-Bench专注于被认为超出当前语言模型的功能的任务。我们评估了OpenAI的GPT型号，Google内部密集变压器体系结构和大型基础上的开关稀疏变压器的行为，跨越了数百万到数十亿个参数。此外，一个人类专家评估者团队执行了所有任务，以提供强大的基准。研究结果包括：模型性能和校准都随规模改善，但绝对的术语（以及与评估者的性能相比）；在模型类中的性能非常相似，尽管带有稀疏性。逐渐和预测的任务通常涉及大量知识或记忆成分，而在临界规模上表现出“突破性”行为的任务通常涉及多个步骤或组成部分或脆性指标；社交偏见通常会随着含糊不清的环境而随着规模而增加，但这可以通过提示来改善。

translated by 谷歌翻译

The CAMELS project: public data release

Francisco Villaescusa-Navarro , Shy Genel , Daniel Anglés-Alcázar , Lucia A. Perez , Pablo Villanueva-Domingo , Digvijay Wadekar , Helen Shao , Faizan G. Mohammad , Sultan Hassan , Emily Moser

分类：人工智能 | 机器学习

2022-01-04

制定了具有机器学习模拟（骆驼）项目的宇宙学和天体物理学，通过数千名宇宙的流体动力模拟和机器学习将宇宙学与天体物理学结合起来。骆驼包含4,233个宇宙学仿真，2,049个n-body和2,184个最先进的流体动力模拟，在参数空间中采样巨大的体积。在本文中，我们介绍了骆驼公共数据发布，描述了骆驼模拟的特性和由它们产生的各种数据产品，包括光环，次麦，银河系和空隙目录，功率谱，Bispectra，Lyman - $ \ Alpha $光谱，概率分布函数，光环径向轮廓和X射线光子列表。我们还释放了超过骆驼 - 山姆的数十亿个星系的目录：与Santa Cruz半分析模型相结合的大量N身体模拟。我们释放包含350多个Terabytes的所有数据，并包含143,922个快照，数百万光环，星系和摘要统计数据。我们提供有关如何访问，下载，读取和处理数据AT \ URL {https://camels.readthedocs.io}的进一步技术详细信息。

translated by 谷歌翻译

OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical Locomotion

Vittorio La Barbera , Fabio Pardo , Yuval Tassa , Monica Daley , Christopher Richards , Petar Kormushev , John Hutchinson

分类：机器人 | 机器学习

2021-12-11

肌肉驱动控制是跨越不同领域的兴趣的研究课题，特别是生物力学，机器人和图形。这种类型的控制尤其具有挑战性，因为模型通常是过度的，并且动态被延迟和非线性。然而，这是一个非常良好的测试和调整的致动模型，该模型经历了数百万年的演变，并且涉及有趣的性质利用肌肉肌腱单元的被动力和有效的能量存储和释放。为了促进肌肉致动模拟研究，我们基于Mujoco模拟器释放鸵鸟的3D肌肉骨骼模拟。 Ostriches是地球上最快的搭配之一，因此是研究肌肉驱动的双模运动的优秀模型。该模型基于CT扫描和解剖，用于收集诸如插入位点，长度和钢圈角度的实际肌肉数据。除此之外，我们还提供一组加强学习任务，包括参考运动跟踪和颈部的达到任务。参考运动数据基于我们预处理和适应我们模型的各种行为的运动捕获剪辑。本文介绍了如何使用任务构建和迭代地改进模型。通过将它们与从机车鸟类的实验收集的电拍摄数据进行比较来评估肌肉致动模式的准确性。我们认为，这项工作可以是生物力学，强化学习，图形和机器人社区之间的有用桥梁，通过提供快速且易于使用的模拟。

translated by 谷歌翻译

Compresion y analisis de imagenes por medio de algoritmos para la ganaderia de precision

David Agudelo Tapias , Simon Marin Giraldo y Mauricio Toro Bermudez

分类：计算机视觉

2021-11-19

我们希望在数据结构和算法的主题项目中解决的问题是破译某些图像，这些图像具有更具特异性的牛动物;其中有必要识别动物是否健康，也就是说，如果它在选择牛的过程中要考虑到良好的条件，或者如果它生病，以知道它是否被丢弃。通过这种压缩算法，这允许拍摄图像并将它们带到这些代码中的检查，并不总是结果将是百分之百精确，但允许此代码是什么允许的高效，它是它适用于机器学习，这意味着它需要的信息越多，结果就越精确，结果将越高，而不会带来一般的兴趣。所提出的算法是NN和双线性插值，其中在执行速度上获得了显着的结果。它的结论是，可以做得更好的工作，但随着交付的事情，据信这是工作的好结果。

translated by 谷歌翻译